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INTRODUCTION 


A coronavirus, an RNA virus, has been identified in 
humans as the pathogen responsible for the recent world- 
wide Severe Acute Respiratory Syndrome (SARS) outbreak. 
Rapid study of the critical SARS enzymes will provide useful 
inhibitors for this disease as well as a methodology that may 
be applicable to fight emerging pathogens in the future. The 
necessity of this rapid approach is further justified by the fact 
that the SARS virus reservoir has not been identified and 
future viral outbreaks are still possible. Indeed, new cases 
have already been reported in 2004." The SARS coronavirus 
(SARS CoV) genome is a positive strand of RNA with 11 open 
reading frames and a genomic organization similar to other 
coronaviruses. However, phylogenic analysis and sequence 
alignments have shown that the SARS CoV has developed 
distinctly from other coronaviruses.” The viral replication 
proceeds via synthesis of a complementary minus strand 
RNA using the genome as a template and the subsequent 
synthesis of genomic plus strand RNA from the minus strand 
RNA template. The key enzyme responsible for both steps is 
the RNA-dependent RNA polymerase (RdRp), represented 
by nonstructural protein 12 (nsP12). Since RdRp activity is 
not essential for the host, its inhibition will not cause 
undesirable side effects during therapy. With the enzyme 
characterization in its early stages, a homology model based 
on the amino acid sequence can provide some clues concern- 
ing the residues critical for its function. We have built a 
homology model of the SARS polymerase using the 3D jury 
system.” Recently, during preparation and revision of this 
study, Xu et al.* reported an RdRp model using the program 
Modeller. Although their and our models have been built 
using a different approach, the r.m.s between the two models 
is about 1.6 A (for Ca atoms). The similarity between the two 
models can be partially explained by the fact that the 3D-jury 
system uses a set of prediction methods that includes the 
Pcons method on which the Modeller program is based on. 
The capability of the 3D-jury in using consensus between 
Meta prediction methods makes it a very reliable tool for 
model building as proven by its highest ranking in the 
CASP5 (Critical Assessment of Techniques for Protein Struc- 
ture Prediction) competition. This predictor has been used 
recently to successfully assign a methyl-transferase function 
to nsP13 and to build a 3D model that identified residues 
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critical for the enzyme function.® Substrate and ligand 
docking in the active site were carried out with the Genetic 
Optimization Ligand Docking Program (Gold) giving us an 
estimate of ligand binding in the active site. This docking 
method, with a success rate of around 70% through its 
validation process, proved to be very reliable when used with 
caution.” 


MATERIALS AND METHODS 


In order to identify homologues for the nsP12 protein, 
the sequence (ID_NP828869) was initially submitted to 
the BLAST® search engine. However, as this did not yield 
any significant results, we then used a more powerful 
consensus structure prediction method (meta-predictor), 
the 3D-jury system. We submitted the sequence to the 
3D-jury Meta predictor system. All models with a 3D-jury 
score higher than 50 are considered to be significant. A 
three-dimensional model for the SARS nsP12 comprising 
474 residues was built based on the homologous rabbit 
Hemorrhagic Disease Virus RdR polymerase® (pdb ID: 
1KHW_B), which had a 3D-jury score of 165. The final 
model was built using the GIGSAW program?® and its 
potential energy minimized using the GROMOS96 pro- 
gram.'' The model quality was evaluated with Procheck’” 
and then analyzed. The pdb ID for the model is 1SXF. 
Ligand docking in the active site was carried out with the 
Genetic Optimization Ligand Docking program (Gold).” 
The ChemScore’® and AG, i,.aing Were evaluated to discrimi- 
nate between ligands. The ChemScore value is obtained by 
adding a clash penalty and internal torsion terms to 
AGpinaing) Which militate against close contacts in docking 
and poor internal conformation."* 


RESULTS AND DISCUSSION 


As the sequence alignment search for SARS RdRp 
conducted with BLAST did not yield a homologous struc- 
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A: Ribbon diagram of SARS RNA-dependent RNA polymerase. The characteristic fold of a “right 


hand’ fingers, palm, and thumb can be recognized from residues 396 to 870 of nsP 12. B: Structural detail of the 
active site of SARS RNA-dependent RNA polymerase. The catalytic residues critical for the nucleotidyl transfer 
reactions are R553 and R555 and K551. D760 and D761 are responsible for the coordination of the metal ion. 
C: Structure of nucleoside analogs: derivatives from molecule X are XA1: R1 = OH; R2 = CH,OH, XA2; R1 = 
CH,OH; R2 = CH,OH. D: Nucleoside analogue, 2’-O-methylcytidine and uridine triphosphate (UTP) docking 


in SARS RdR polymerase using GOLD. 


ture, we turned to the 3D jury Meta predictor to identify a 
homology model. The power of this method is illustrated by 
the fact that even with less than 10% sequence identity, 
the Rabbit Hemorrhagic Disease Virus (RHDV) RdR poly- 
merase,® was identified as a homologous structure, based on 
a combination of motif search, fold recognition, and structure- 
based alignment orchestrated by the 3D-Jury Meta server. 
The RdRp function was assigned to a C-terminal segment of 
587 residues out of the nsP12 932 residues with a highly 
significant score (3D-jury score > 164 in our work). No 
characteristic motifs suggesting specific functions were found 
for the 396 N-terminal residues. Subsequently, a three- 


dimensional structure for the nsP12 SARS RdRp (residues 
395 to 870) was built based on the homologue’s structure. 
Our model has 78% of residues in the favored region of 
Ramachandran plot and the proportion reaches 98.8% when 
we consider residues in the allowed regions. 

The SARS-CoV nsP12 folds in a classic “right hand” shape 
that contains motifs shared by all RdRps in which the 
characteristic fingers, palm, and thumb sub-domains can be 
recognized [Fig. 1(A)]. The finger domain is composed of 
three helices (residues 462—477, 561-584, and 619-637), the 
RdRp characteristic helix-loop-helix (residues 619-658), and 
two stranded £-sheets (residues 535-541and 551-560). The 
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TABLE I. Docking of Substrate and Nucleoside Analogues 
to SARS-CoV RdR Polymerase Active Site’ 


AGpinaing 
Ligand ChemScore (kJ - mol?) 
Uridine triphosphate 19.77 —20.3 
2'-C-Methyladenosine 18.27 —18.8 
2'-O-Methylcytidine 15.27 —15.4 
Molecule XA1 16.52 -17.4 
Molecule XA2 19.77 —21.0 


*ChemScore = AG) inaing + Clash penalty + internal torsion terms.”° 


palm domain consists of two helices (residues 687—706 and 
776-793) and the B-hairpin (residues 755-769) that contain 
the catalytic aspartates 760 and 761 responsible for the 
nucleotide transfer reaction. This domain contains the con- 
served “SDD” sequence in the RdRPs.'* The thumb domain 
consists essentially of two a-helices (residues 833-843 and 
854-869) and a large loop between residues 798-818. The 
thumb and finger domains are interconnected with loops 
running between them, as is the case for the Hepatitis C 
virus’? and RHDV RdRps.® The active site is buried in the 
center of the protein. It has an architecture very similar to 
other RdRps [Fig. 1(B)]. We can identify all of the important 
residues by sequence homology. Sequence alignment of the 
SARS-CoV polymerase with other known coronavirus poly- 
merases showed a complete conservation of the basic resi- 
dues (K545, R553, R555, and K551) and aspartates 760 and 
761.1© These aspartate residues are responsible for the 
coordination of the metal ion involved in the nucleotidyl 
transfer reaction.? In addition, residue R836, which is par- 
tially conserved in RdRp, contributes to nucleoside binding 
by making a hydrogen bond with the O4 of the cytidine 
moiety. 

The present RdRp 3D-model highlighted by the domain 
structure and by the active site architecture can provide a 
rationale for the design of inhibitors for the synthesis of 
viral RNA. In fact, there are presently no data on SARS 
coronavirus polymerase inhibition. In this context, we 
used data on the inhibition of other related enzymes, most 
notably hepatitis C virus (HCV) polymerase, to help us to 
design anti-SARS agents. It has been shown that nucleo- 
tides can bind to the HCV polymerase catalytic site even in 
the absence of the template’’ and that nucleoside ana- 
logues such as 2’-C-methyladenosine and 2'-O-methylcyti- 
dine are potent competitive inhibitors.*”1* 

Starting from our RdRp model, we carried out the 
docking of the substrate UTP and nucleoside analogues. 
CTP, UTP, and GTP have a similar Km as determined for 
the hepatitis C virus NS5B RNA-dependent RNA polymer- 
ase.'? As the active site determinant residues are con- 
served in the SARS CoV enzyme, we should expect very 
similar binding pattern for the nucleosides. The AG); ning 
value shown in Table I demonstrates as expected a some- 
what lower binding energy for the substrate UTP com- 
pared to 2’-O-methylcytidine and 2’-C-methyladenosine. 

Based on our nucleoside analogue bound models, we 
have designed several compounds and determined their 


binding energy. Efforts were made to create more binding 
interactions between the protein and the nucleoside ana- 
logue through the modification of the latter’s. The most 
promising result has been obtained with thymidine substi- 
tuted 2’-O-methylcytidines. These are molecules X1A and 
XA2, as shown on Figure 1©, where in XA1: R1 = OH; 
R2 = CH,OH8; and in XA2: R1 = CH,OH; R2 = CH,OH. 
The computed binding energy (AG, ;,aing) of molecule XA2 
is 40% lower than the lead compound and is even slightly 
better than substrate UTP (Table I). 

This encouraging result prompts us to continue with the 
modeling and synthesis of this compound. Such attempts 
are important in advancing the design of potent drugs for 
the SARS therapy. 
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